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Abstract 

This article is concerned with an optimal control problem derived by mean-field forward-backward stochastic differential 
equation with noisy observation, where the drift coefficients of the state equation and the observation equation are linear with 
respect to the state and its expectation. The control problem is different from the existing literature on optimal control for 
mean-field stochastic systems, and has more applications in mathematical finance, e.g., asset-liability management problem 
with recursive utility, systematic risk model. Using a backward separation method with a decomposition technique, two 
optimality conditions along with two coupled forward-backward optimal filters are derived. Several linear-quadratic optimal 
control problems for mean-field forward-backward stochastic differential equations are studied. Closed-form optimal solutions 
are explicitly obtained in detailed situations. 
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1 Introduction 

1.1 Notation 

We denote by T > 0 a fixed time horizon, by R™ the 
m-dimensional Euclidean space, by | • | (resp. (•,•)) the 
norm (resp. scalar product) in a Euclidean space, by 
(resp. A~^) the transposition (resp. reverse) of A, by 
S'™ the set of symmetric m x m matrices with real el¬ 
ements, by fx the partial derivative of / with respect 
to X, and by C a positive constant, which can be dif¬ 
ferent from line to line. Let (,^t)o<t<T,P) be a 

complete filtered probability space, on which are given 
an ,^t-adapted standard Brownian motion (wt, Wt) with 
values in and a Gaussian random variable ^ with 
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mean /ig and covariance matrix cto- (wjw) is indepen¬ 
dent of If A G S'™ is positive (semi) definite, we write 
A > (>)0. If X : [0,T] ^ R™ is uniformly bounded, 
we write x G (0, T ; R™). If a; : fl —>■ R™ is an ^t- 
measurable, square-integrable random variable, we write 
X G AfJ-(R™). If a: : [0, T] x fl —^ R™ is an ,^t-adapted, 
square-integrable process, we write x G jSfJr(0, T; R™). 
We also adopt similar notations for other processes, Eu¬ 
clidean spaces and filtrations. 

1.2 Motivation 

Now consider an asset-liability management problem of 
a firm. Let the dimension n = k = r = r = l. Denote 
by E the expectation with respect to P, by vt the con- 
trol_strategy of the firm, by x1 the cash-balance, and 
by I'l the liability process. Norberg [35] described the 
liability process by a Brownian motion with drift. The 
model, however, is not just the one we want. In fact, it 
is possible that the control strategy and the mean of the 
cash-balance can influence the liability process, due to 
the complexity of the financial market and the risk aver¬ 
sion behavior of the firm. Such an example can be found 
in Huang et al. [22], where the liability process depends 
on a control strategy (for example, capital injection or 
withdrawal) of the firm. Along this line, we proceed to 
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improve the liability process here. Suppose that sat¬ 
isfies a linear stochastic differential equation (SDE, in 
short) of the form 

-dl'l = (dtEcCt -I- bfVt + ht)dt + Ctdwt- 

Here d, b, b, c, a,_f, g and h are deterministic and uni¬ 
formly bounded, bt and Ct denote the liability rate and 
the volatility coefficient, respectively. Suppose that the 
firm owns an initial investment and only invests in a 
money account with the compounded interest rate a*. 
Then the cash-balance of the firm is 

y f* asds {y f — ["ardr jTy 

Xt = eJo ® 

It follows from Ito’s formula that 

J dx^ = {atXf + dtEXf + btVt + bt) dt -I- Ctdwt, 

I = e 

Note that, if bt = 1, bt = 0, at = —dt = const, and 
Ct = const., then the cash-balance equation is just the 
systematic risk model of inter-bank borrowing and lend¬ 
ing introduced in Carmona et al. [11]. Besides the sys¬ 
tematic risk model, the equation can also be reduced 
to an air conditioning control model in energy-efficient 
buildings. See, e.g., Example 2 in Djehiche et al. [12] for 
more details. 


Due to the discreteness of account information, it is pos¬ 
sible for the firm to partially observe the cash-balance 
by the stock price 


( 


dS^ = 

So = 1- 



Set Ej" = log St ■ It holds that E’' is governed by 


f dEj" = {ftXt + gt) dt + htdwt, 

I V = o. 


Suppose that the firm has triple performance objectives. 
The first two ones are to minimize the total cost of v 
over [0, T] and to minimize the risk of Assume that 
the risk is measured by E [(x^ — Ex^)^] . The third one 
is to maximize the utility resulting from v. Without 
loss of generality, define 


2/r =E 


Xt+ G{s,y'^,Vs)ds 




instantaneous control Vt, but also on the future utility 
and control {y^jVs), t < s < T. This shows the differ¬ 
ence between the utility y^ and the standard additive 
utility, and hence, y^ is called as a stochastic differen¬ 
tial recursive utility in Duffie and Epstein [13]. Then the 
asset-liability management problem with recursive util¬ 
ity is stated as follows. 


Problem (AL). Find a cr{Y^; 0 < s < t}-adapted and 
square-integrable process Vt such that 

f Btv'idt + H{x^t - 
0 

is minimized. Here B > 0 and B~^ are deterministic 
and uniformly bounded. H and N are non-negative con¬ 
stants. j/q is the value of y^ at time 0. 



Let us now turn to the recursive utility again. Ac¬ 
cording to El Karoui et al. [14], y^ admits the backward 
stochastic differential equation (BSDE, in short) 

f -dVt = Git, yt,vt)dt - z^dwt - z^dwt, 

\ yt^ = x^. 


With the BSDE, Problem (AL) can be rewritten as an 
optimal control problem derived by forward-backward 
stochastic differential equation (FBSDE, in short) with 
noisy observation. It is possible to work out one more 
asset-liability management problem. We omit the details 
to limit the length of this article. 


1.3 Problem statement 


Motivated by the examples, we study an optimal control 
problem for FBSDE with noisy observation. Consider a 
controlled FBSDE 

dxt = (atXt + dtExj -I- 6(t, Vt)) dt + Ctdwt, 

-dy^ = (atx'l + dtExj -b Pty'l + PtEyl + jtZt 
< -b ^t^z"t -b -b it^Zt + tpit, vt))dt 

— Zt dwt — z\ dwt ^ 

. = y^ = px^ + pEx^, 

where (x"",y^, z"", S'") is the sate, v is the control, and 
(w, w) is the Brownian motion. Since the mean of the 
state influences the state equation, we call the equation 
a mean-field FBSDE, or a McKean-Vlasov FBSDE. As¬ 
sume that {x",y", z",z") is partially observed through 

r dYf" = (ftXf + ftTExt + git, Vt)) dt -b htdwt, 

I iV = o. 


where G is Lipschitiz continuous with respect to iy,v), 
and G(s, 0,0) € T; R) for 0 < s < T. We empha¬ 

size that the current utility y)) depends not only on the 


2 



The cost functional is 


J[v\ = E 






(j>{xyp,'Ex'"j.) + Lp{yl) 


Here Vt is required to be a{Y^;0 < s < t}-adapted and 
to satisfy Esupq<(< 7 ^ < +oo. a, a, b, c, a, a, f3, /3, 
1, 7> 1,1, V”, P, P, f, L 9, h, I, (j) and (f will be specified in 
Section 2. Our problem is to select an admissible control 
V to minimize J[u]. We denote the problem by Problem 
(MFC), where “MF” and “C” are the capital initials of 
“mean-field” and “control”, respectively. 


mean-field type control is also partially inspired by the 
interest in mean-field game. If we only focus on a sin¬ 
gle decision maker, also called a representative agent, 
mean-field game can be regarded as mean-field type con¬ 
trol. Generally speaking, an exact Nash equilibrium for 
mean-filed game with a large nnmber of decision mak¬ 
ers is rarely available except for special cases (see, e.g., 
Carmona et al. [11]). It is highly desirable to find a good 
approximation of this Nash equilibrium. Please refer to 
Carmona et al. [10], Tembine et al. [38], Bensoussan et 
al. [4], etc. for more details on different types of approx¬ 
imation equilibrium. See also Bensoussan et al. [6] for a 
comprehensive study of a general LQ mean field game. 


To solve Problem (MFC), it is natural to use dynamic 
programming and maximum principle. The dynamic 
programming, however, does not hold even if the BSDE 
and the observation equation are not present, mainly 
due to the inclusion of the mean of the state, which 
leads to the time inconsistency. We instead study the 
maximum principle for optimality of Problem (MFC). 

1-4 Briefly historical retrospect and contribution of this 
paper 

Mean-field theory provides an effective tool for inves¬ 
tigating the collective behaviors arising from individu¬ 
als’ mutual interactions in various different fields, say, 
finance, game, engineering. Since the independent intro¬ 
duction by Lasry and Lions [27] and Huang et al. [24,25], 
the mean-field theory has attracted more and more at¬ 
tention. Let us now briefly recall some latest develop¬ 
ments which are related to Problem (MFC). 

Although the study of mean-field SDE has a long history 
with the pioneering works of Kac [26] and McKean [31], 
mean-field type control is a rather new research direc¬ 
tion. In 2001, Ahmed and Ding [1] used the Nisio nonlin¬ 
ear operator semigroup to obtain an extended dynamic 
programming. By dual techniques, maximum principles 
for several mean-field SDEs with fnll information were 
derived. See, e.g., Buckdahn et al. [7], Li [28], Hafayed 
and Abbas [16], Shen et al. [37], Djehiche et al. [12]. 
Subsequently, Meyer-Brandis et al. [32], Hafayed et al. 
[17,18] studied the partial information case, where noisy 
observation and filtering are excluded. As applications 
of the derived maximum principles, [32,41,20] solved 
mean-variance problems with full and partial informa¬ 
tion. Yong [43] studied a linear-quadratic (LQ, in short) 
optimal control problem for mean-field SDE with full 
information. Further, Yong [44] investigated the time- 
inconsistency feature of the LQ problem, and obtained 
both open-loop and closed-loop equilibrium solutions. 
Later, Huang et al. [21] extended the LQ problem to the 
case of infinite horizon. For the discrete-time counter¬ 
part of the LQ problem, please refer to Elliott et al. [15], 
Ni et al. [33,34] and the references therein for more de¬ 
tails. It is worth pointing out that the investigation of 


Both mean-field type control and mean-field game lead 
to mean-field FBSDE. Buckdahn et al. [8] studied the 
well-posedness of a decoupled mean-field FBSDE using 
a limit approach. Bensoussan et al. [5], Carmona and 
Delarue [9] extended [8] to the fnlly coupled mean-field 
FBSDE case in terms of a continuation method intro¬ 
duced in Peng and Wu [36]. Mean-field FBSDE is a well- 
defined dynamic system, it is very natural and appeal¬ 
ing to study control and game problems for mean-field 
FBSDEs as well as their applications. To our knowledge, 
there is only a few literature on this topic. For example, 
Li and Liu [29] studied an optimal control problem for 
fully coupled mean-field FBSDE. Hafayed et al. [19] ob¬ 
tained a maximum principle for mean-field FBSDE with 
jump. Huang et al. [23] studied an LQ game with a lin¬ 
ear mean-field BSDE system and a quadratic cost func¬ 
tional. [19,23] also provided some applications in mean- 
variance and recursive utility problems. 

In this paper, we are interested in studying Problem 
(MFC). Compared with the above literature, this prob¬ 
lem has several new features as follows. 

• The state {x"", y"", z"", S'") satisfies a mean-field FBSDE 
rather than a mean-field SDE, and is only partially 
observed by a noisy process. This endows Problem 
(MFC) more practical meanings in reality. 

• Unlike those control models solved in Bensoussan [3], 
the classical separation principle does not work here, 
mainly due to the fact that the mean square error of 
filtering of BSDE depends on the control in general. 

• The state equation involves the mean of the state, and 
thus. Problem (MFC) can not be studied by trans¬ 
forming it into a standard control problem for FB¬ 
SDE. This feature can be supported by Example 2.2 
in this paper. 

There is a few papers related to Problem (MFC). Let 
us make a brief comment on them. Wang et al. [42] 
posted a partially observable mean-field type optimal 
control problem for SDE. They used a backward sepa¬ 
ration method and a probability transformation to de¬ 
couple a circular dependence between the control and 
the observation first, and then derived a necessary con¬ 
dition for optimality. The result was further generalized 
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in Wang et al. [41] by the backward separation method 
with an approximation technique. Later, Hu et al. [20] 
studied an optimal control problem for mean-field SDE 
with jump. Zhang [45] addressed the case with corre¬ 
lated state and observation noises. We emphasize that 
the approach applied in [20,41,42,45] is based on at least 
one of the assumptions below. 


• The state satisfies an SDE rather than an EBSDE. 

• The drift term of the observation equation is uniformly 
bounded with respect to the state and the control. 

• The control has no effect on the observation. 

• The controls satisfies Esupo<i< 7 ’ \vt\^ < -foo, > 0. 


Clearly, Problem (MEC) does not meet these assump¬ 
tions. Another approach is desired to develop to address 
Problem (MFC). In [40], Wang et al. studied an LQ con¬ 
trol problem for classical EBSDE (i.e., the dynamics of 
the EBSDE does not depend on the probability distri¬ 
bution of the state). Inspired by Bensoussan [2], they 
solved the LQ problem by combining a decomposition 
technique with the backward separation method. Re¬ 
cently, our further study on the approach provided in 
Wang et al. [40] finds out the availability of the approach 
to some nonlinear control problems with noisy observa¬ 
tions, say, Problem (MFC). In this paper, we will show 
how to use the approach to address Problem (MFC). 
See also Wang et al. [39] for other developments about 
partially observable optimal control for EBSDE. 


The rest of this article is organized as follows. In Section 
2, we carefully formulate Problem (MFC) first, and then 
provide illustrative examples and preliminary results. In 
Section 3, we obtain two optimality conditions and two 
coupled forward-backward optimal filtering equations. 
In Section 4, we study an LQ case of Problem (MFC) 
and obtain a feedback representation of optimal control. 
In Section 5, we explicitly solve an asset-liability man¬ 
agement problem with noisy observation, and work out 
an illustrative numerical example. Some concluding re¬ 
marks and proofs of the preliminary results are given in 
Section 6 and Appendix, respectively. 


2 Problem formulation and preliminary 


One difficulty to study Problem (MFC) is there is a cir¬ 
cular dependence between the control v and the obser¬ 
vation Y'’, which results in the unavailability of classical 
variation. Here we will adopt a decomposition technique, 
similar to those of [2,40], to overcome the difficulty. De¬ 


fine (a:°, 2 °, zP) and by 


dXf = {atx^ + UtEa;®) dt + Cfdwt, 
-dy° = {atx° + dtEa;° -f fdtVt + Pt^Vt 
+ jtSt + dt 

— dwt — Zf dwt , 


■ ItZt 


^0 ~ 


V 0 

yj, = pxj, 






and 


dY° = {ftx° + /tEa;°) dt + htdwt, 


= 0, 


( 1 ) 

( 2 ) 


where a, d € J§f“(0, T; IR”''))), c G .5^“(0, T; IR"^’’), 
a, d G ^“(0,r;lR’”''"), /3, /3 G J§f“(0,T; IR'"''™), /, 
/ G h,h-^ G j = 

{jl,--- ,Jr), 7 = (7i.--- ,7r), 7 = (7i.--- ,7r), 7 = 


(71G-- ,7f). z° = {i 


1 5 


?), 5° = (5?,' 


^r), 7i, 


7i> 7i) 7i ^ ^°°(0, T ; IR™ '") for j = 1, • • • , r or f, and 
p, p & IR'"^" are constant matrices. Here we use the 
simplified notation jtZt — Similarly, it is 

also applicable for the notations 7(5°, 7 iEz°, 7 tEz°, • • •. 


V G .ifJ-(0,T;lR^) be a control process. Define 
and Y'’^ by 


• V,1 _ V,1 


= atx^’^ + atEXi’^ -I- b{t, vt), 


= (' 


atx^' - 

V,1 

■ ItZt 


atEx^’^ - 
-7tE<’' 




~L?,1 


■ ItZt 


■ /3tEy, 

■ 7*^5, 


V,1 


V,1 


+'tjj{t, Vt)) dt - Zt’^dwt - Zt'^dwt, 


_ V,1 


xo'^=^, yT = + 


p Ex 


1?,1 
T ’ 


and 


-f g{t,vt), 

= 0 , 


( 3 ) 

( 4 ) 


where g : [0, T] x —)■ IC satisfies E \g{t, vt)\‘^dt < 

-boo, b : [0, T] X 1R'= ^ IR" and f : [0, T] x 1R'= ^ R”" are 
continuous and continuously differentiable with respect 
to t and V, respectively, by G jSf°°(0, T; IR"^^) and -tpy G 
^ 00 ( 0 , y; IRfnxfc)^ Since (1) and (3) are decoupled, it is 
easy to see from Buckdahn et al. [8] that (1), (2), (3) and 
(4) have unique solutions, respectively. Define 


V 0 I "^^,1 V 0 I 4;,1 

Xt =Yl+XtY yt =y'^ +yY , 


V,1 


-L?,l 


( 5 ) 


and 

= + ( 6 ) 

It follows from Ito’s formula that (x“, y'", z'’, z'’) and Y'’ 
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uniquely solve 

dx'l = {atx'l + dtVjxl + b{t, vt)) dt + Ctdwt, 

-dVt = {atXt + OtExj + Ptyt + + ItZt 

< +7tE2:t + ItZt + 7tE5t + -tpit, Vt)) dt 

— Zt dwt — Zt dwt, 

(7) 

and 

r dYi’ = [ftXt + /(Ext + g{t, Vt)) dt + htdwt, , . 

{ y„” = o. <*> 

respectively. Let = a{Y^;0 < s < t} and = 
CT{yg°;0 < s < t}. Note that the first observable filtra¬ 
tion depends on the control v. However, the second one 
is not the case. We now give a definition of admissible 
control. Let 17 be a nonempty convex subset of E*, and 
let be the collection of all -adapted processes 
with values in U such that EsupQ<j< 7 n \vt\^ < -l-oo. 

Definition 2.1. A control v is called admissible, if u S 
^ad -adapted. The set of all admissible controls 

is denoted by ^ad- 


Problem (MFC). Find a u G ‘^ad such that J[u] = 
inft,g< 2 /^^ J[v] subject to (7), (8) and (9). Any u satisfy¬ 
ing the equality is called an optimal control of Problem 
(MFC), and (a;“, y“, 7“) and J[u] are called the opti¬ 

mal state and the optimal cost functional, respectively. 

Note that the above decomposition technique is re¬ 
stricted to special structures of state and observation 
equations, say, the case that (7) and (8) are linear with 
respect to (x", y’', z". S'"), the diffusion coefficient of (7) 
is deterministic, and the drift coefficient of (8) is in¬ 
dependent of {y'", z", S'"). It is worth investigating the 
availability of the technique to decompose more general 
state and observation equations in the future. 

Next, let us show more new features of Problem (MFC) 
by two simple examples. Roughly speaking. Example 
2.1 tells us that Problem (MFC) is possibly applied to 
solve a partially observable optimal control problem for 
mean-field SDE with stochastic coefficients in certain 
situations; Example 2.2 reveals that Problem (MFC) is 
not a trivial extension to a partially observable optimal 
control problem for FBSDE without mean-field term. 

Example 2.1. Let at = at = fdt = Pt = = 0 

Problem (MFC). Then (7) is reduced to 


With the definition, it follows from the equality (6) that 


Proposition 2.1. For any v G '^ad, S^'t" = ■ 

It implies that the control v has no effect on the obser¬ 
vation Y", i.e., the circular dependence between v and 
Y" is decoupled. 


The cost functional is in the form of 


J[v] = E 


l{t, Xt, ^Xt,vt)dt 


(9) 


+(j){x"rp,'Ex"j.) + ipiy'"^)], 


{ dx'l = {atx^ + OtExj -I- h(t, Vt))dt Y Ctdwt, 

~dyt = {'-ftZt + ltz"t Y ■i/'(C vt))dt - Ztdwt - Stdwt, 

x" =^, y"j.= px"x Y pEx"x. 

( 10 ) 

Solving the BSDE in (10), we get 


J/S = E 


{riT,pxT + pExx) 


{Vt,'ipit,vt))dt 


( 11 ) 


with 

r dpt = ItPtdwt Y ytVtdwt, 

\ ^0 ~ djji , 

where Im is an m-dimensional vector with all compo¬ 
nents being 1. Plugging (11) into (9), we have 


where I : [0, T] x E”+” x U ^ U, (j) : E”+” ^ E 
and cp : E™ —> E are continuously differentiable with 
respect to (x,x,v), (x,x) and y, respectively, and there 
is a constant C > 0 such that 

lc/>(x,x)i<C(lYlxi^Ylxn, 

\4>x{x,x)\ Y \(j)xix,x)\ < (7(1 -I- \x\ Y |x|), 

\l{t,x,x,v)\ < (7(1 -I- Ixp -I- |xp -I- |?;p), 
\lxit,x,x,v)\ < (7(1 -b \x\ Y |x| -b |?^|), 
Hy)\<CilY\yn 

\Yy{y)\<C{lY\y\^) 

with X = x,x, V. Then the optimal control problem for 
mean-field FBSDE is restated as follows. 


J[v] = E 


l{t, x^,'Eixl,vt)dt Y (j){x'^, Exj 


-b (/? (E [{pT, + pExy) 

+ J {Vt,tpit,vt))dt . 


Then Problem (MFC) is reduced to minimize J\v\ over 
‘^ad subject to (8) and the SDE in (10). It is worth not¬ 
ing that we start with a control model with determin¬ 
istic coefficients, but we end up with a control model 
with stochastic coefficients. The interesting phenomena 
is caused by the introduction of the BSDE in (10). Just 
because of this, maybe it provides a potential method to 
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investigate a control problem for mean-field SDE with 
stochastic coefficients under certain conditions, i.e., we 
can change it into an equivalent control problem for 
mean-field FBSDE with deterministic coefficients. The 
details of how to make use of this potential method will 
be shown in our future publications, because they be¬ 
yond the scope of the present paper. 


r d{Y,^ - EY-) = [Mx^t - Exl) + gtivt - En*)] dt 
< +htdwt, 

[ Y- - EYJ = 0, 
respectively. Let 


/ ^ - Mo \ f Vt-Evt 

V Mo / V 


Example 2.2. Eind a v € ‘^ad such that 

JH = E [ + 

Jo 

E{BtVt, Vt)) dt -f {Hx^, x^) 

+ {HExl^,Exl^) + {Myly-)] 

is minimized, subject to (7) and (8) with the assumption 

b{t, Vt) = btvt + bt, ipit, Vt) = -^tvt + git, Vt) = gtvt + 
gt, where A,A e .5f“(0, T; S'”), B € T; S'=), 

H,H G S^, M G S’”, A,B,H,M > 0, A + A,H + 
H > 0, b € Af°°iO,T;¥C^'^), b G if“(0,T; R”), 
^ G jSf“(0,r;R’”'"'=), ^ G .if“(0,r;R’”), g G 
.if“(0,r;R^^''') and g G T; R^). Eor simplicity, 

we denote the LQ problem by Example (MFLQ). 


Taking expectations on both sides of (7) and (8), we have 


Ext — —Ext — {at + at)Ext + btEvt + bt, 

-Ey” = - ^Ey,” = (at + at)^x- + iPt + A)Ey” 
+ ilt + 7i)Ezr + ilt + 7i)Ezr 
+ i}tEvt +i)t, 

Exq = Mo, Ey^ = [pG- p) Ext, 


yr - Eyr 


Yj” - EY” 


a,= “• V . b,= V» . 6 ,= “ 

\ 0 at + at J \ 0 bt I \bt 


at 0 
0 at + at 


Pt 0 
0 Pt+Pt 


It 0 
0 7t -f 7t 


= V), = 

V 0 7i + 7i / V 0 V’i / \i’t 


0 p + p 


ft 0 
0 ft + ft 


EY- = -EY” = (ft + ft)Ex)^ + gtEvt + gt, 

EYo” = 0, 


respectively. Then 

dixt—Ext)= at{xt — Ext) + btivt — Evt) dt 

+ ctdwt, 

-diyl - Ey”) = [atix'l - Exf) + Wt - ^vl) 

Y^tiz'l-Ez'D+Uit-^-^) 

I +fi)tivt-Evt^ dt 


— zl dwt — Zf dwt, 
Xq - Ea;o = ^ - p,o, 

y" - Ey^ = p{x^T - Ex^), 


dx” = (atx” -I- btV( -I- bt) dt + Ctdwt, 
-djt = {at^t + PtYt + ftZt + JtZt + iptvt 


dt — cZj dwt — cZt dwt, 


Xq = xo, yr = pyAp, 


dYj = (fiX)" -I- gt)dt + htdwt. 


YS = 0. 
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On the other hand, let 


3.1 Optimality conditions 


(A, 0_\ 

y 0 At + At J 



VO H + H ) 


M = 


/m 0 \ 

\ 0 M ) ' 


By simple calculations, we get 


According to Theorem 2.1 above, the optimality condi¬ 
tions can be derived by minimizing J[u] over sub¬ 
ject to (7) and (8). We remind reader again that these 
results are different from the existing literature, mainly 
due to some new features of Problem (MFC). For exam¬ 
ple, the state is governed by a mean-field FBSDE, and 
is partially observed via a noisy process. 


+ (AtEx^Ex")} = E(Atx)’,x)’), 
E{(i7x^,x^) -f (i7Ex^,Ex^)} = E(Hx^,x^), 
E(Mj/S,j/S) = E(My(;,y(;), E{BtVt,Vt) = E(B,Vt,v,). 


Then the cost functional is rewritten as 


J[v] = E 


((Atx)’,x)')-f (Btvt,vt))dt 

0 

(Hx^,x?,) + (My(;,yS)]. 


(14) 


Note that (14) together with (12) and (13) forms a 
standard-looking LQ problem for FBSDE with noisy ob¬ 
servation. However, the BSDE in (12) is not a standard 
form due to the irreversibility of c. Moreover, the control 
domain has to satisfy some extra constraint conditions 
according to the form of the control v. This implies that 
Example (MFLQ) cannot be reduced to a standard LQ 
problem for FBSDE, hence it cannot be immediately 
solved by the standard LQ theory for FBSDE. 


Theorem 3.1. Assume thatu is an optimal control for 
Problem (MFC). Then the mean-field FBSDE 

dkt = {Pjkt Pj Efct) dt (jJkt -\- pj Efc*) dwt 

-j- (pj kt + pj Efct) dwt, 

^ -~dpt= [aJpt + l]:{e^)+E{aJpt + lJ{e^))-aJkt 
-aj Efct] dt - qtdwt - ijtdwt, 
ko = -TyiVo)^ 

^ PT = Pj{^lf)+TFPj{Elf)-p^kT-p^’EkT 

(15) 

has a unique solution {k,p,q,q) in the space Af|r(0,T; 
^^+n+nxr+nxf-^ SUc/l that 


E 


Hy{t,\T(,uukt,Puqt){v - Ut) 


> 0 (16) 


for any v € U, where the Hamiltonian function H is 
defined by 


In the end of this section, we give a preliminary result, 
which shows that the desired optimality condition can 
be derived by minimizing J[x] over 


Theorem 2.1. 


inf .J[v'] 


inf 


J[v]. 


The proof can be found in Appendix. 


□ 


3 Optimal solution of Problem (MFC) 

For any v,Vj G ‘Wad, let (x", yA 5Q and , 

z"-*) be the solutions of (7) corresponding to v and Vj, 
j = 1, 2, • • •. For simplicity, we set 

(0^) = (t,x^,Ex^,At), {E() = (x^,Ex^), 

(n^) =ixt ,yt,Zt,Zt, Ex(, Ej/j, E^j, Ez^) 

with X = v,u,Vj, j = 1, 2, • • •. 


H{t, X, y, z, z, X, y, z, z, v; k,p, q) 

= {atx + dtx + b(t,v),p) + {ct, q) + l{t,x,x,v) 

- (atx -I- atx + pty + Pty + Itz F ptz F ptz 

Fptz F'if{t,v),k) . 


Proof. If u is an optimal control for Problem (MFC), 
Theorem 2.1 implies that J[u] = inf„g.^o T[x]. For any 

V G let f“+^^)“be the solution 

of (7) corresponding to u -I- £x, 0 < e < 1. Introduce a 
variational equation 

xf t = atxft + dt^xf t F by{t, Ut)vt, 

-dyl, = {atxlt F at^xlt F Ptvlt + 'Pt^vlt 
< + + 7iIEz“t -f PtZi^t + 

Ffvft, ut)vt) dt - zftdwt - zftdwt, 

- ^1,0 = Oj Vi.T = pxfr + P Ex“ r, 

which admits a unique solution (x“, ?/“, z“, z“) G 
T; ]r"+-+-x-+™xq. It follows from Taylor’s ex¬ 
pansion, Holder’s inequality and the techniques applied 
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in Lemma A.l (see Appendix) that 


(k,p,q,q) € .Sf|r(0, T; is the solution 

to (15). Thenu is an optimal control of Problem (MFC). 


lim E sup ' * 

o<t<T 


u+ev u 


Xt — X 


= 0 , 


and then, 


lim E sup 

=-*■0 0 <t<T 


- yr 


ypt 


= 0 . 


Combining the limits with the optimality of u using the 
first variation of J[u], we have 


0 < lim 

s —>^0 


J[u + ev] — J[u] 


= E / (ij iei)xi, + ij{emxit + C iQt)vt)dt 


+E(0j (S“ )x“ ^ (S“ )Ex“ ^ iy-)yl,). 


(17) 


On the other hand, once (x“, j/“, z“, z“) is determined 
by (7), there is a unique solution {k,p, q, q), in the space 
.if|r(0, T; to (15). Using Ito’s formula 

to {xf,p) + {yf, k) and inserting it into (17), we get 


Jo 

Due to M G and the arbitrariness of Vt, we deduce 


E 
= E 


{pjby{t,ut) - kjipy{t,ut) + iji&f)) iiy-ut)\.^t 

Hy{t,Uf,Ut;kt,pt,qt)ii^ - ■Wt)|=^t^° 


> 0 , 


for any n gU. Since u G '^ad, it follows from Proposition 
2.1 that ■ Then the result is derived. The 

proof is complete. □ 

Theorem 3.2. Assume that for any {t, x,y, z, z, x, y, z, z, 
v) G [0 T] X X U 

(x,x,v) 1 -^ l{t,x,x,v), (x,x) I—>• 4}{x,x) and y i-G 
p(y) are convex. Assume that b{t,Vt) = bfVt + 
bt,ip{t,vt) = -ipiVt + where b G .if°°(0, T; 1R”_^''), 
b G Jf°°(0,T;lR”), if G if“(0,T;lR’"^'=), ff G 
Sf°°{0, T ; IR'"). Let u G 'Wad and 


E 


H{t,U^,Ut;kt,pt,qt) 


= inf E 

veu 


H{t,Uf,v;kt,pt,qt) 


(18) 


(0 2^- 

the solution to (7) under the admissible control u, and 


where (a;“, y“, z“, 5“) G Af^{0,T;WC - .) ^s 


Proof. For any v G 'Wad, we write 

J[v] - JM =h+l 2 + h 

with 


(19) 


/i = Ey (1(0)1) - l{ef)) dt, /2 = E [</)(S^) - </)(S^)], 

/3 = e[^°(j/(;)-^(j/“)]. 

By virtue of the convexity of 4> and applying Ito’s formula 
to {p, x"" — x^), we have 

/a > E {pT, xVp - x^) + E {p^kr + E(p^fcT), xlf - xlf) 

= -E r {Ij (0“) + E Ij(0“) -ajkt- E(d7 kt), 

Jo 

x'') - xf) dt + E, [ (pt,bt{vt - ut)) dt 


( 20 ) 


+E{p^kT + E{p^kT),xlf-xlf). 


Similarly, applying Ito’s formula to {k, y'’ — y“) and the 
convexity of p, we derive 

h > -E{kT, p{xlf - xlf) + pE(x^ - xlf)) 


-E 


[ {kt,atixt - xf) + atE{xt - Xt) 

Jo 


+'4’tivt - Ut)) dt. 


( 21 ) 


E 

4 *= 


It is easy to see from (19), (20), (21) and the convexity 
of I that 

J[v] — J[t6] 

>E / Hy{t,Uf,Ut;kt,pt,qt){vt-Ut)dt 
Jo 

pT ^ 

= e/ E Ha{t,ILf,ut]kt,pt,qt){vt-Ut)\^l° dt. 
Jo 

Fmthei, using Theorem 2.1 and (18), we get 

Hy(t,Ht,ut;kt,pt,qt){vt - 

H{t,IFf,ut]kt,pt,qt)\-^l" {vt - Ut) > 0. 
Then this implies the desired result. □ 

3.2 Optimal filters 

The minimum condition (16) shows that we need to 
analyze the optimal filters of (7) and (15) depend- 


t 



ing on in order to compute u. For this, for 

any v € ‘^ad we denote by ft = E ] with 

it = x^,x^,y^,z:^,z^,y^ix^y and kt = E 
with Kt = kt,pt,Ptixty,ktixtV,qt the optimal fil¬ 
ters of Lt and Kt, respectively. Moreover, we denote by 
St = E [{xt — Xt){xt — it )^] the mean square error of 
it, V e ^ad- 


Qt = ht + {ptix'^y -pt{xfy^ fj {ht ^y. 


We emphasize that (22) and (23) are two forward- 
backward optimal filters. It shows that the difference 
between Theorem 3.3 and the classical filtering litera¬ 
ture, say, Bensoussan [2,3], Liptser and Shiryayev [30]. 


Using Theorem 12.7 in Liptser and Shiryayev [30] and 
Theorem 3.1 in Wang et al. [42], we derive (22) and (23), 
respectively. 


Theorem 3.3. For any v G ‘^ad, the optimal filters 
(xt,yt) and {kt,Pt) of the solutions {xt,yt) and {kt,pt) 
to (7) and (15) with respect to and satisfy 


dxf = {atx( -I- OtExt -I- b(t, Vt)) dt 

+ St/t^ {ht ^ydwt, 

-dyf = {aty + dtExt -b Pty( + fit^y( + Itif 
Ffit^zf -b fitzf + 7tEit F'f{t,vt))dt 


- Zywt, 

Xq ~ kO: Vt ~ pXT -b pExy, 


and 


( 22 ) 


dkt = (^Pjkt + Pj Efct) dt + jJkt + pj Efct 


-dpt = jot pt + E 


(kt{xfy - kiyy) / 7 (ht 


dwt, 




yt, 


— aJ kt — aJ lE/pt 


+E (a^Pt + y (OD) }dt- Qtdwt, 


ko= - PyiVo), 


py = E 


^T(s^) +E</>i (s^) 


— p' kx — p^Efcy, 


respectively, where S is the unique solution of 

±t - atZt - ^taj + Et/7 {hpYhyfyt 

- ctcj = 0, 

y = CTo, 


(23) 


(24) 


Wt = f hy [dy° - (/.i° + /.Ex°) ds] (25) 

is a standard Brownian motion with value in and 

Zt = zt + (yfix^y - yyyy^ fyhpk^, 


4 An LQ case of Problem (MFC) 


We still adopt the notations and the assumptions intro¬ 
duced in Sections 2 and 3 unless noted otherwise. 


Problem (MFLQ). Minimize 

iH = [{Atx^t.xO + (AtEx^Exn 

-b {Btvt,vt) -b 2{Dtx(,vt) + 2{DtE,x(,vt) 
+2{Ft,x() +2{Ft,t]x() +2{Gt,Vt)] dt 
+ {HxP, xP) -b (iiEx^, Ex^) -b 2{L, x^) 
+2{L,I5xy + {Myl,yl) + 2{N,yl)] (26) 


over '^ad with the control domain U = R^, subject to 
the state equation 

( dx( = {atx( -b atExt -b btVt + bf) dt + Ctdwt, 
-dyf = {atx( + dtExt -b Ptyf + ^tEyJ' -b -ftzf 
-b ptzf -b pt^zf + iftVt + y) dt 
— zfdwt — zfdwt, 
xl=e ylf = pxp + p^xlf, 

(27) 

and the observation equation 


dYfi = [ftx( + /tExJ" -b gt) dt + htdwt, 

YJ = 0 , 


(28) 


where A, A e J§f“(0, T; S'”), B € ^°°(0, T; S'^), 

H, H G S^, A, H, M > 0, B > 0, A + A, ff + ff > 0, 
D,D G J§f“(0,r;lR'=^”), F,F G J§f“(0, T; R”), 

G G FP°°{f),T-,ny, L,L G R",A G R””, b G 
.^“(0, T; R”''''^), b G T; R”), i) G T; R'”^''), 

([> G Af°°{0,T-,Yr), and g G ^^{0,T;Rk- 

Note that we do not assume the positive semidehniteness 
of A and H in Problem (MFLQ). Then (26) covers the 
performance functional of Problem (AL) as a special 
case. 

Proposition 4.1. Ifu is an optimal control for Problem 
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(MFLQ), then 

ut= - B-^ (bjE [pti^r] - [kt\-^r 

+AE [x'l\^r] + + Gt) , 

where {k,p) is the solution of the adjoint equation 


tively. By Theorems 3.1, 3.2 and 3.3, an optimal feed¬ 
back control is explicitly obtained. The procedure of how 
to solve is decomposed into five steps below. Note that 
such an optimal control will play a role in Problem (AL). 
Please refer to Section 5 below for more details. 

Step 1: A reduced LQ problem. 


dkt = {Pjkt + Pj Efct) dt + {jJkt + pj Efct) dwt 
+ (77 kt + 77Efct) dwt, 

-dpt = [ajpt - ajkt + Atxf + DJut + Ft 

+E {dt Pt ~ del kt + Atxf -\- DJ Ut + A)] dt 
- qtdwt - qtdwt, 
ko= - My^ - N, 

Pt = FIxt + H^Xt + L + L — p~^ kr — p^Ekr, 

(29) 

together with the state equation 

dxf = {otxf + dt^xf + btUt + bt) dt + Ctdwt, 
-dyf = {atxf + dtExf + Ptylf + Pt^y^ + itz'f 
+7iEz“ -l- ptZt + 7iEz“ -l- p’t'Ut + 7i) dt 
— zfdwt — Ztdwt, 
a;“ = ^, y^ = pxlf + plFxlf. 

(30) 


Proof. With the above data, the Hamiltonian function 
is 


Integrating and taking expectations on both sides of the 
BSDE in (27), we have 



with 

S f"(dT+d-r)dT rp 

Xi = , t < s < 1. 

Plugging the equality into (26), we derive an LQ problem 
for mean-field SDE as follows. 


Problem (MFLQ)'. Find a V £ “Wad to minimize 

J[v] = [{Atx'i^x'i) + (A,Ex)l,Ex)l) 

-I- {BtVt: Vt) F 2{DtXt,vt) -I- 2{DtF^Xt,vt) 
+2{Ft,x'l)+2{Fu^x'j) + 2{Gt,Vt)] dt 
+ {Hxlf, xlf) + (HEa;^, TFxlf) + 2{L, xlf) 

-|-2(Z', } -l- t/o 


H{t, X, y, z, z, X, y, z, z, v, k, p, q) 

= {atx + dtx + btv + bt,p) + {ct, q) - {utx + dtx + Pty 
+Pty + ^tz + ptz + ptz + ptz + tftv + -4>t,k) 

+ 2 [(Aa;, x) + {Atx, x) + {Btv, v) + 2{Dtx, v) 

+2{DtX, v) + 2{Ft, x) F 2{Ft, x) F 2{Gt, v) , 

where {k,p,q,q) is determined by (29) together with 
(30). If u{) is optimal, it follows from Theorem 3.1 that 

ut= - Bf^ (57E [pt\^r] - A^E [kt\:^r 

+AE [:rri^ 7 “] + DtlExf F Gt) , 

where {x'^,k,p) is the solution of (30) with (29). Then 
the proof is complete. □ 

Note that one more explicit optimal control u strong de¬ 
pends on a certain special structure of the state equa¬ 
tion and the cost functional. Next, let us consider a par¬ 
ticular case of Problern (MFLQ), i.e., let M = 0 and 
/3i = 7t = 7t = 7i = Ti = 0 in (26) and (27), respec- 


subject to 

f dxt = (atXt -I- OtEx)’ -I- btVt F h) dt F Ctdwt, 

I = ? 

and (28) with 

F = FF{xlatVN, F = # + (x^dt)^ A 

A = A + (Xo A = Z-I-(xo p)^A 

G = GF{xli’tVN, Jo= [ N^xl^tdt. 

Jo 

Step 2: Candidate optimal control. 

The Hamiltonian function is 

F[{t,x,x,v;p,q) = {atX F dtX F btV F bt,p) F {ct,q) 

+ ^[(Aa:,a;) -I- {Atx,x) F {Btv,v) 

F 2{DtX, v) F 2{Dtx, v) -|- 2{Ft,x) 
F 2{Ft, x) F 2{Gt, w)], 


10 



where (p, q) is determined by the Hamiltonian system 


chain rule for computing the derivative of (34), we have 


dx^ = [atx^ + OtEx” + btUt + h) dt + Ctdwt, 
-dpt = [ajpt + AtXt + DJ ut + Ft 
' +E (ajpt + Atx'l + dJU t + -Ft)] dt 

- qtdwt - qtdwt, 

x" = 5, PT = Hx^ + HEx^ + L + L. 

( 31 ) 

If u is optimal, then it follows from Theorem 3.1 or 
Proposition 4.1 that 


ut= - B, 


^t'E 






DfWj 




+FtExr + Gt) 


— — ^ (bjPt + DtX^ + -DtEa;“ + Gt) . (32) 


Ept = d'fEa;” + d'tExJ' + 4't 

= |$t + 4'f [at + at - btBt^ {Dt + Dt)] 

-^tbtB-^bJ<^>t} Ea;“ + Tt - ^tbtB-^'St 

+ ‘ht {bt — btBt ^Gt) ■ 


Comparing it with the second equation in (33), we de¬ 
duce a Ricatti equation 


< 


4>f + 4>t [at -I- at 

+ ["(at + at)"'' — 


- btB-^ {Dt + Dt)] 
{Dt + Dt)^ Bi^bJ 


— <PtbtBt ^bj^t + At + At 


- {Dt + Dt)^ B^^ {Dt + Dt) = 0, 
<^>t = H + H 


4>t 


(35) 


Step 3: Feedback representation of (32). 


Inserting (32) into (31) and taking expectations, we get 
a fully coupled forward-backward ordinary differential 
equation (ODE, in short) 


and an ODE 


< 


+ 


(at + at) 


{Dt + Dt)'^ Bi^hJ 


-^thB-^bJ] Tt + $t {ht - htB-^Gt) 

- {Dt + Dt)^ Bt^Gt + Ft + Ft = 0, 
= L -\- L. 


(36) 


' Ex“ 

Ept 


[ot + at — btB^ ^ {Dt -|- Dt)] Ex“ 

— btBt ^bj Ept — btBt ^Gt + bt, 

— [^t + ~ {Dt + Dt) Bt ^ (Dt + Dt) 




(at + ttf)^ — {Dt + Dt) Bt ^bj Ept 


+ (Dt + DYB^^Gt-Ft-Ft, 

.Ex^=/to, IFpT = (H + H)Ex^ + L + L. 

(33) 

According to Theorem 2.6 in Peng and Wu [36], (33) has 
a unique solution (Ex“, Ep) based on the assumption 


Clearly, (35) admits a unique solution, and thus, (36) 
also has a unique solution. Plugging (34) into the first 
equation of (33), we derive 

Ea;“ = [ot -f at — btBt ^ {Dt + Dt) 

— btB^ ^Gt H“ 

Ea^o = Mo, 
which can be explicitly computed. 


(Al). There is a constant G > 0 such that 


GF xn 


At -|- At — (Dt + Dt 


)^Bi^{Dt + Dt) 


< 0 . 


Hereinafter, Inxn stands for an n x n unit matrix. 


Noticing the terminal condition of (33), we set 

Ept = 4>tEx“ + Tt (34) 

for two deterministic and differentiable functions $ and 
T such that <I>t = F[ + H and Ty — L + L. Applying the 


Using Theorem 3.3 to (31) with (32), we get the optimal 
filtering equation 

dxt = [{at-btB(~^Dt)xt -btB(~^bJpt + 9i^t]dt 

+ ^tf7ihi^)^dwt, 

< -dpt = [{At - Dj Bt^Dt) + (a^ - Dj B^^bJ) pt 
+92,t] dt - Qtdwt, 

, Xq = po, Pt = FIx^ + HExt + L + L 

(38) 

with 

^i,t = (df — btBt ^Dt) Ea:“ — btBt ^Gt + bt, 
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respectively. 


02,t = {At - dJB t^Dt - dJB t^Dt 

-DjB-^Dt) Exr + {a] - DJB-^bJ) Ep* 

- {Dt + Dty B-^Gt +Ft + Ft, 

where S and w satisfy (24) and (25), and Ea:“ and Ep 
solve (37) and (34), respectively. We assume that the 
following condition holds. 

(A2). There is a constant C > 0 such that 

DjB-^Dt-At-CInxn<0. 


Then (38) has a unique solution (a;“,p, Q) in the space 
.if^yu(0,r;lR"'''”^"^’') by using Theorem 2.6 in Peng 
and Wu [36] again. Similarly, let 

pt = Ttx'l + At (39) 

for two deterministic and differentiable functions T and 
A such that Ft = H and At = 77Ext + L + Z. It follows 
from Ito’s formula that 


dpt 


TtXtdt + Ftdxt + Atdt 


{ [ft + Tt {at - btBt^Dt) - FtbtB, 


~^bjrt 


+rt {di,t — btBt ^bj At) + At I dt 
+ Ft^tfJ (ht^yd-Wf 


Comparing it with the BSDE in (38), we derive 


Step 4: (42) is the optimal control. 


According to (25), Wt is an -adapted standard 
Brownian motion. Then it is easy to see from (43) that 
Xt is -adapted, and hence, Ut given by (42) is - 
adapted. On the other hand, applying Ito’s formula to 
{SF,SF) with Burkholder-Davis-Gundy inequality, we 
deduce 

E sup < - 1 - 00 . 

0 <t<T 

Then u € ^ad- Next, we will prove that Ut is also ~ 
adapted. If so, then u € and consequently, u is 
optimal via Theorem 3.2. In fact, using (25) again, (43) 
can be rewritten as 

' = { [at - btBt^ {Dt + bjFt)] x'l 

~btBt ^bjAt + Oi^t } dt 

< +Etf7iKYK^[dYt- 

- {ftXt + /tEa;]* -b gt) dt] , 

. do = do- 

From the optimal filtering equation, it is easy to check 
that Xt is -adapted, so is Ut- Then u € ^ad- There¬ 
fore, the claim holds. 


r Ft + Ft {at - btB-^Dt) + {aj - DJB-^bJ) Ft 
< - FAB^^bJFt + At- DjBtDt = 0, 

[rT = H 

(40) 

and 

r A, + {aJ - DjBt^bJ - FABi^bJ) At 
I +rt0i,t + 02.t = O, (41) 

[ At = H^x^ + L + L, 

which have a unique solution, respectively. Substituting 
(39) into (32), we get 

Ut = -B-i [{bjFt + Dt) x'i + AEt“ +b]At + Gt] , 

(42) 

where Ea;“, T, A and x^ solve (37), (40), (41) and the 
closed-loop system 

r dx'l= {[at-btBi^DtFbjFt)]x^ 

\ —btBt ^b^ At + di,t) dt + YtfJ (ht dwt^ 

I = do, 

(43) 


Step 5: Optimal cost functional. 


Since the solution S of (24) is independent of v, the 
optimal cost functional is rewritten as 

J[u] = Ji[u] -b [ N^Xo'^tdt -b i /" tr{AtT,t)dt 
Jo ^ Jo 

+ triHYr) 

with 

JiM = ^lFy\{AtX^,x^) + {AtEx^,lEx^} 

-b {BtUt, Ut) -b 2{DtXt , Ut) + 2{DtExt , Ut) 

F 2{Ft,x^) -b 2{Ft, Ex“) -b 2{Gt, Ut)]dt 
+ {Hx^, x^) -b {HEx^, Ex^) -b 2{L, x^) 
+2(Z,Et^)}. 

(44) 

Here solves (43), and tr{A) denotes the trace of the 
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matrix A. Using Ito’s formula, we deduce 
d(rtxr,xr} = {{DjBtDt-At - rtbtB^^bjTt)x^ 

-2TtbtB-^b ]At + 2Tt6i,t, i“) dt 

+ {'BtfJ (V'D dt 

+ {Vt^tfJ{K7^dwt,x'i) 

+ {Ttx'i,^tf7{K7'^dwt) . 


We summarize the above deduction as follows. 

Proposition 4.2. Under the assumptions (Al), (A2), 
M = 0, and jdt = = it = It = It = 0, the opti¬ 

mal feedback control and the optimal cost functional of 
Problem (MFLQ) are explicitly given by (42) and (fi), 
respectively. 


Then 

^{Hxlf,xlf) 

= E / ( {Dj BtDt -At- TthBi^] Ft) x'f 
Jo 

-2FtbtBfX At + 2rt0i,t, xf) dt (45) 

+ / {^tf7iKY,^t^tf7{hfY}dt 

Jo 

+ (roMo, po). 

Similarly, applying Ito’s formula to (A, x“), we get 
lEiHExlf + L + L,xlf) 

= j (At, 9i^t — btBt ^bjAt) dt{Ao,eo) 

— E / {Ftdiy02x,Xt)dt. 

Jo 

Plugging (42), (45) and (46) into (44), we derive 




{2Dt + Dt)^ B^^Dt-At 


-2a7rt] Ex)*, Exf) dt - (i7Ex^, Ex^)} 

+ r (DjBtHJ Pt - rt67 Bt^Gt 

Jo 

-FtbtBt^bJAt, Ex") dt 

+ [ {bjTt,Bi^{bjAt + Gt))dt 

Jo 

+ — / (At, 2bt — 2btBt ^Gt — btBt ^bjAt) dt 

^ Jo 


{Gt,Bf^Gt)dt 


■^{^otJo, Uo) + {Ao,fJ-o) 


+ ll (St/7 (KY, rtSt/7 iK7^) dt 


+ / N^Xoi’tdt - tr{AtP,t)dt 
Jo ^ Jo 


(47) 


5 Solution to Problem (AL) 


In this section, we are interested in explicitly comput¬ 
ing Problem (AL) introduced in Section 1.2 with the as¬ 
sumption that the generator 

G{t,y,z) = Pty + f^tv, 

where [3 and f) are deterministic and uniformly bounded. 
With the generator. Problem (AL) is a special case 
of Problem (MFLQ). (33) is reduced to a decoupled 
forward-backward ODE 

' Ex" = {at at)Ex“ — Bf^b^Ept 

+ NBf^btftte-fo^‘‘'^ + bt, 

Ept = - (at + at)Ept, 

^ ExO = po, EpT = -Ne-fo^ 

Solving it, we get 


Ept 


= —fggJ7dtd'*+J7Ga+°'Ads 


(48) 


and 


Epsds 


Exf = f^oeJ'oG^+dAds _ 7^_i^2g/;(a.+a„)d. 

Jo 

+ (i, + 

(49) 

where Ep is determined by (48). (40) and (41) are rewrit¬ 
ten as 


and 


ft + 2atFt - Bi^blTl = 0, 
Ft = H 


At + (at — Bt ^b^Ft) At -\- FtOi^t + ^2,t — 0, 


\ At = -LIExt - Ne^o 
where 

6»Tt = atEx" -f Bt^bttpt 


(50) 


(51) 


tAe-Z'o 


ds 


+ bt, 02, t — atEpt, 
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with E/? and Ex be determined by (48) and (49). This 
gives 


At = - + Ne-fo ett V,r,)ds 

Then the optimal control strategy in a feedback form is 


ut = -b: 


bt (Ttxt + At) - Ni^te^o 


p^li.ds 


Here F and A solve (50) and (51). The filtered cash- 
balance x“ satisfies a closed-loop system 

r dx'i = [(at - BiHlVt) x'i - Bi^b^kt + 0i.t] dt 

^ ^dwt, 

[ ^0=1^0 

with S being governed by 

r St - 2atEt + VV'S? - c? = 0, 

1 So = CTo- 


The remaining part of this section is devoted to illustrat¬ 
ing the above results via numerical computations. For 
the sake of illustrations, let the time granularity for all 
the market parameters be yearly. Suppose that the ini¬ 
tial investment ^ = 1, the discount rate at = 0.03, the 
liability rate bt = 0.01, and the volatility rates Ct = 0.04 
and ht = 0.1. We further suppose that the market pa¬ 
rameters dt = 0.03, Pt = 0.06, ft = 0.1, bt = ift = Bt = 
iV = 1, and H = 0.01. With the data, it is easy to see 
that 


Ept 

Exr 

Ft 


_ g 0 . 06 ( 2 -t) 


„0.06t 


. 25 

l + t + -e 


0.12 


(1 _g-U.i2t) 


6 

0.03Ex“ -f e° °“ -f 0.01, 6»2,t = -0.03e° °®(2“‘\ 

0 o6e0 06(i-‘) 

5 g0.06(l-t) > 


and 


At = 


- (O.OlEa:)* -f f.o.06yJ^ 

+ ^ (F,0i,, + 


Then the optimal control strategy of the firm is 
Ut = -(Fti" +At- e°-°^‘), 

where the optimal filtering of the cash-balance a;“ satis¬ 
fies 


J dxf = [(0.03 -Tt)xt - At + 9i^t] dt + Etdwt, 
1 x“ = l 

with 

^ 0.08(e° “ - 1) 

* “ eO.ii _ 4 ’ 

F, A and 9i being determined above. 


6 Concluding remarks 


This article studies an optimal control problem for mean- 
field FBSDE with noisy observation. Since mean-field 
FBSDE and optimal filtering are considered, the control 
problem has been basically unexplored so far. The con¬ 
trol problem covers more models in reality, but causes 
a trouble in solving the problem. The backward sepa¬ 
ration method with the decomposition of the state and 
the observation is further developed, and is introduced 
to overcome the resulting difficulty. These results ob¬ 
tained in this article improve the first author’s previous 
works [39,40,41,42], and are helpful for studying mean- 
field game for EBSDE and systematic risk model with 
noisy observation. The details of how we study these 
problems will be presented elsewhere. 


Let us now make several remarks in order to close this 
section. (1) In most optimal control problems for mean- 
field stochastic systems, we assume that all coefficients of 
the optimal control problems are deterministic. Other¬ 
wise, there is an immediate difficulty to study the prob¬ 
lems. One reason is that the key equality E,{atXt) = 
at^xt is no longer true if at is also a stochastic pro¬ 
cess. But some special cases with stochastic coefficients, 
say, Example 2.1, can be solved by a simple reduction 
method. Then it is natural to ask if the method is ap¬ 
plicable for slightly more complicated cases. We hope 
to answer it in the near future. (2) Similar to Exam¬ 
ple 2.2, Problem (MELQ) can also be reduced to an LQ 
problem for non-standard EBSDE with control set con¬ 
straint. This motivates us to investigate such a class of 
LQ problems for non-standard EBSDEs in the future. In 
return, it will be helpful to study LQ problems for mean- 
field EBSDEs. (3) The solution of the BSDE in (7) is a 
non-Gaussian process in general, and thus, the optimal 
filter of the BSDE is infinite. Then it is highly desirable 
to study the numerical approximation of the optimal fil¬ 
ter and the optimal control in our future publications. 
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Appendix 


In a same way, we have 


We present three lemmas first, and then give a proof of 
Theorem 2.1. 

Lemma A.l. For any vj G T; R^), j = 1,2, 

there is a constant C > 0 such that 

E sup < CE f \v\^t —V 2 ,t'\^dt, 

0 <t<T Jo 

E sup \yr-yr\" 

0<t<T 

+ E r {\zr-z^^\^ + \sr-zrndt 
Jo 

<(7 I Ejxy — Xp p + f sup — x^^l'^dt 

\ Jo 0 <s<t 

+1E J |ui,t - ■ 

Proof. These two estimates can be derived by Ito’s 
formula, Gronwall’s inequality and Burkholder-Davis- 
Gundy inequality. We omit the proof for simplicity. □ 

Lemma A.2. For any v, Vj G “Wad {j = 1, 2, • • •) satis¬ 
fying Vj V in T ; R^), it holds 


m^T ) ^ m^T), ^F{yi^) ^ 

with j —>■ + 00 . Then the proof is complete. □ 

Lemma A.3. ‘^ad is dense in 

Proof. For any v G define a family of controls by 

for 0 < t < Sj, 

j- /(fii) 5 , ^sds, for i6j <t<{i + l)(5j, 

where v G U, i, j are natural numbers, 1 < i < j — 1, 
and Sj = T/j. Similar to Bensoussan [2], we can prove 
that (i) Vj G ^ad for any j, and (ii) Vj v as j ^ +oo 
in (0, T; U). Then it implies the desired result. □ 

Proof of Theorem 2.1. From Definition 2.1, we have 
“^ad C and thus, inf„/g^^^ J[v'] > inf^e‘2^„'>^ JM- 
On the other hand, since Vj defined in the proof of 
Lemma A.3 is an element of ‘^ad, then inf„/ €'^ad JW] < 
J[uj], and consequently, it follows from Lemma A.2 that 
inf„/g<^^^ J[v'] < limj_,._i_oo J[vj] = T[u]. Due to the ar¬ 
bitrariness of V, then inf„/g<2qd JW] < 

Thus, the proof is complete. □ 
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